Goto

Collaborating Authors

 greedy feature construction


Greedy Feature Construction

Neural Information Processing Systems

We present an effective method for supervised feature construction. The main goal of the approach is to construct a feature representation for which a set of linear hypotheses is of sufficient capacity -- large enough to contain a satisfactory solution to the considered problem and small enough to allow good generalization from a small number of training examples. We achieve this goal with a greedy procedure that constructs features by empirically fitting squared error residuals. The proposed constructive procedure is consistent and can output a rich set of features. The effectiveness of the approach is evaluated empirically by fitting a linear ridge regression model in the constructed feature space and our empirical results indicate a superior performance of our approach over competing methods.


Reviews: Greedy Feature Construction

Neural Information Processing Systems

The paper is well written, and easy to understand. My biggest concern is a lack of comparison to other related approaches, such as single index models. It seems to be that the method you propose is somewhere between SIM and kernel methods, and while you do better than the latter, the former might be better (in terms of error, but slower algorithmically or need more data). So a comparison is warranted - Line 17: here and elsewhere, you use the term "capacity". Can you make the notion of capacity precise?


Greedy Feature Construction

Oglic, Dino, Gärtner, Thomas

Neural Information Processing Systems

We present an effective method for supervised feature construction. The main goal of the approach is to construct a feature representation for which a set of linear hypotheses is of sufficient capacity -- large enough to contain a satisfactory solution to the considered problem and small enough to allow good generalization from a small number of training examples. We achieve this goal with a greedy procedure that constructs features by empirically fitting squared error residuals. The proposed constructive procedure is consistent and can output a rich set of features. The effectiveness of the approach is evaluated empirically by fitting a linear ridge regression model in the constructed feature space and our empirical results indicate a superior performance of our approach over competing methods.